Database Oriented Chart Parsing
نویسنده
چکیده
We introduce the notion of parsing and translation using Database technology. The framework focuses on the notion of “Database Oriented parsing”. We describe the basic notion of linguistic representation using relational tables in SQL and we specify a set of operators on these tables for parsing and translation. The framework provides a flexible mechanism for modeling cross linguistic information about different languages in terms of relations and attributes. We are applying the model for parsing and translation of French sentences into English. In our work all the grammatical relations are specified as relations in SQL and constraints on word order are also defined in terms of relations. The work suggests a flexible multi-lingual approach to grammatical modeling and parsing.
منابع مشابه
Address Recognition with Robust NLU Technology
An analysis component recognizing addresses on printed documents and identifying the respective sender or recipient is presented. The component’s kernel is a standard chart parser for feature-based context-free grammars. The parsing procedure is enhanced with the island-parsing strategy and the ability for partial parsing. The needs of the realworld application within document analysis of print...
متن کاملObject-oriented parsing of biological databases with Python
MOTIVATION While database activities in the biological area are increasing rapidly, rather little is done in the area of parsing them in a simple and object-oriented way. RESULTS We present here an elegant, simple yet powerful way of parsing biological flat-file databases. We have taken EMBL, SWISSPROT and GENBANK as examples. EMBL and SWISS-PROT do not differ much in the format structure. GE...
متن کاملAMOS: A Natural Language Parser Implemented as a Deductive Database in LOLA
In this paper we present the set-oriented bottom-up parsing system AMOS which is a major application of the deductive database system LOLA. AMOS supports the morpho-syntactical analysis of old Hebrew and has now been operationally used by linguists for a couple of years. The system allows the declarative specification of Definite Clause Grammar rules. Due to the set-oriented bottom-up evaluatio...
متن کاملTFLEX: Speeding Up Deep Parsing with Strategic Pruning
This paper presents a method for speeding up a deep parser through backbone extraction and pruning based on CFG ambiguity packing.1 The TRIPS grammar is a wide-coverage grammar for deep natural language understanding in dialogue, utilized in 6 different application domains, and with high coverage and sentence-level accuracy on human-human task-oriented dialogue corpora (Dzikovska, 2004). The TR...
متن کاملAn Augmented Chart Data Structure with Efficient Word Lattice Parsing Scheme In Speech Recognition Applications
In this paper, an augmented chart data structure with efficient word lattice parsing scheme in speech recognition applications is proposed. The augmented chart and the associated parsing, algorithm can represent and parse very efficiently a lattice of word hypotheses produced in speech recognition with high degree of lexical ambiguity .without changing the fundamental principles of chart parsin...
متن کامل